An Overview of Optical Character Recognition Systems Research on Telugu Language

نویسندگان

  • D Jayaram
  • CRK Reddy
  • Kamakshi Prasad
چکیده

This paper gives an overview on the development process and ongoing research of the optical character recognition (OCR) systems for Telugu Text. The aim of this paper is to provide a starting point for the researchers entering into this field. In this paper, we present the introduction, characteristics of the Telugu language, developmental process of the OCR systems of Telugu language, research work done on Telugu scripts reorganization and scope for the future work in Telugu OCR systems. KeywordsOCR, Segmentation, feature extraction, Connected Component (CC), classification.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optical Character Recognition (OCR) for Telugu: Database, Algorithm and Application

Telugu is a Dravidian language spoken by more than 80 million people worldwide. The optical character recognition (OCR) of the Telugu script has wide ranging applications including education, health-care, administration etc. The beautiful Telugu script however is very different from Germanic scripts like English and German. This makes the use of transfer learning of Germanic OCR solutions to Te...

متن کامل

Multi-font Optical Character Recognition System for Printed Telugu Text

The Telugu OCR systems available in the market currently recognize only the specific fonts of Telugu. This paper describes the development of a multi-font OCR system for printed Telugu characters using Artificial Neural Networks. In this system classification of the characters is carried out using multi layer neural network Architecture.

متن کامل

Efficient Recognition of Telugu Characters Based on Critical Points Generated Using Morphological Methods

A novel method for recognition of telugu character is proposed in this paper. The proposed method uses extraction of critical points of the characters based on grid and radial intersections analysis. The extracted critical points are classified based on the grid and radial lines, which helps in improving accuracy in recognition of characters. The algorithm is tested on various data sets and the...

متن کامل

Classification and Identification of Telugu Handwritten Characters Extracted from Palm Leaves Using Decision Tree Approach

Research in character recognition is very popular for various application potentials in banks, post offices, defense organizations, reading aid for the blind, library automation, language processing and multi-media design. Even though Epigraphical work dealing with stone inscriptions have been analyzed, these have been done largely manually and also on 2D traces. A large collection of these are...

متن کامل

OCR of Printed Telugu Text with High Recognition Accuracies

Telugu is one of the oldest and popular languages of India spoken by more than 66 million people especially in South India. Development of Optical Character Recognition systems for Telugu text is an area of current research. OCR of Indian scripts is much more complicated than the OCR of Roman script because of the use of huge number of combinations of characters and modifiers. Basic Symbols are...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012